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Abstract 



The computationally efficient vector quantization for a discrete time source can be performed using lattices over 
t/3 , block linear codes or convolutional codes. For high rates (low distortion) and uniform distribution the performance 

of a multidimensional lattice depends mainly on the normalized second moment (NSM) of the lattice. For relatively 
low rates (high distortions) and non-uniform distributions the lattice-based quantization can be non-optimal in terms 
of achieving the Shannon rate-distortion function H{D). 
^ , In this paper we analyze the rate-distortion function R{D) achievable using linear codes over GF{q), where 

g is a prime number We show that even for q = 2 the NSM of code-based quantizers is close to the minimum 
^■f^ , achievable limit. If q ^ oo, then NSM l/(27re) which is the NSM of the infinite dimension sphere. By 

^SJ ' exhaustive search over q-ary time-invariant convolutional codes with memory < 8, the NSM-optimum codes for 

q = 2,3,5 were found. 

' For low rates (high distortions) we use a non-lattice quantizer obtained from a lattice quantizer by extending 

CO . the "zero zone". Furthermore, we modify the encoding by a properly assigned metric to approximation values. Due 

' to such modifications for a wide class of probability distributions and a wide range of bit rates we obtained up to 

1 dB signal-to-noise ratio improvement compared to currently known results, 
•i— I , 

^ Index terms — Arithmetic coding, convolutional codes, lattice, lattice quantization, normalized second 
■ moment, random coding, rate-distortion function, scalar quantization, vector quantization 

I. Introduction 

For an arbitrary discrete time memoryless continuous stationary source X = {x}, approximation 
alphabet Y = {y}, and a given distortion measure d{x,y) the theoretical limit of the code rate R under 
the condition that the average distortion does not exceed D is determined by the Shannon rate-distortion 
function H{D) [1], 

H(D)= min {I(X;Y)} 

{f{y\x)}:E[d(x,y)]<D' 

where I{X; Y) denotes the mutual information between X and Y. 

An encoder of a discrete time continuous source is often referred to as a vector quantizer, and the code 
as a codebook [2]. Coding theorems about achievability of H{D) are proved by random coding techniques 
without imposing any restrictions on the codebook structure. The encoding complexity for such a codebook 
in general is proportional to the codebook size; i.e., the encoding complexity for sequences of length n 
with rate R bits per sample is proportional to 2^". 
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A large step towards reducing the quantization complexity is due to using quantizers based on mul- 
tidimensional lattices [3]. In particular, Conway and Sloane in [4] and [5] proposed a simple encoding 
algorithm, which, in essence, requires a proportional to dimension n of a codebook number of scalar 
quantization operations and subsequent exhaustive search over the lattice points in vicinity of the source 
sequence. The complexity of this quantization still grows exponentially in the dimension n of the vectors, 
but much slower than 2^". 

Important results related to the vector quantization efficiency are obtained by Zador [6], who has 
shown that, in the case of small quantization errors, the characteristics of the quantizer depend on a 
parameter Gn which is determined by the shape of the Voronoi polyhedrons of the codebook entries. For 
lattice quantizers, i.e., when all Voronoi regions are congruent to each other, Gn is equal to the so-called 
normalized second moment (NSM) of the lattice (see ^ in Section 2 below). 

Constructions of lattices with good values of NSMs are presented in [3]. For example, for the Leech 
lattice A24 NSM = 0.0658. Since for lattice Z (the set of integers) NSM = 1/12, the so called "granular 
gain" of a lattice quantizer in dB is computed as 10 logio(12NSM). Therefore the granular gain of A24 with 
respect to Z is equal to 1.029 dB which is 0.513 dB away from the theoretical limit 10 log^o(12/27re) = 
1.54 dB corresponding to covering the Euclidean space by n-dimensional spheres when n ^ 00. 

Granular gains achieved by different code constructions were reported in [7]. In particular, the Marcellin- 
Fischer [8] trellis-based quantizers are presented in [7] as an NSM record-holder. For example, by a trellis 
with 256 states at each level MSN=1.36 dB is achieved (only ?»0.17 dB from the sphere-covering bound). 

The Marcellin-Fisher [8] quantizer nowadays is still the best-known in the sense of granular gain. 
It means that these quantizers are good for uniformly distributed random variables. Meanwhile results 
reported in [8] for the Laplacian and Gaussian distributions were later improved in the papers [9], [10], 
[11], and [12]. Furthermore, the best currently known results for the generalized Gaussian distribution 
with parameter a = 0.5 is presented in [13]. 

Despite numerous attempts to construct good quantization schemes, the gap between asymptotically 
achievable results and the best existing quantizers is still large. For example, for 32-state trellises and bit 
rate 3 bits/sample, this gap is equal to 1.10, 1.53, and 1.78 dB for the Gaussian distribution, Laplacian 
distribution, and generalized Gaussian distribution with a = 0.5, respectively [13]. 

Since the best codes and coding algorithms for non-uniform distributions are not the same as for the 
uniform distribution, the following question arises: could it be that codes which deliberately are not good 
in sense of granular gain be very good for a non-uniform distribution? 

The positive answer to this question is one of the results of this paper. Below we present the coding 
scheme which with a 32-state trellis at 3 bits/sample is not more than 0.32, 0.35, and 0.44 dB away from 
the Shannon limit for the three distributions mentioned above. Therefore, the achieved gain over existing 
trellis quantizers exceeds 1 dB at high rates for a wide class of probability distributions. 

We have searched good quantizers among lattice quantizers over linear codes. The motivation is that the 
constructions of some good lattices are closely related to the constructions of good block error-correcting 
codes (e.g., the Hamming code and Golay codes). One more argument in favor of such trellises appears in 
[14], [15] and [16], where it was shown that there exist asymptotically optimal lattices over g-ary linear 
codes as q tends to infinity. 

Although the NSM decreases (the granular gain increases) with alphabet size q, it was found by simu- 
lations that the quantization performance behavior for non-uniform quantization is different: binary codes 
appear to be better than codes over larger fields. Replacing the quaternary codes, typically considered the 
in trellis-based quantization literature, by binary codes is the first step towards improving the quantization 
performance. 

Another step is based on the important observation (see, e.g., [17], [18]) that the entropy-coded uniform 
scalar quantization with properly chosen reconstruction values provides near the same coding efficiency as 
the optimum entropy-constrained scalar quantization (ECSQ) does. Moreover, choosing zero quant larger 
than others, we obtain a simple scalar quantizer with performance extremely close to the ECSQ for all 
rates. In [18] this quantizer is called Extending Zero Zone (EZZ) quantizer. Notice that EZZ quantization 
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does not follow the minimum Euclidean distance criterion. Therefore, it would be reasonable not to use 
the Euclidean distance for vector quantization (or trellis quantization as a special case) as well. 

We begin in Section 2 with reviewing the random coding bounds [14], [15] on the NSM value for linear 
code based lattices. The asymptotic behavior of the NSM as a function of the code alphabet size is studied 
and we show that code-based lattices are asymptotically optimal when the alphabet size tends to infinity. 
In Section 3 the code search results are presented. In Section 4 we present entropy-constrained code- 
based quantization. By generalizing the EZZ approach to lattice quantization, we obtained near optimum 
SNR values for all code rates for the classes of non-uniform probability distributions mentioned above. 
Concluding remarks are given in Section 5. 

II. Code-based quantization 
An n-dimensional lattice can be defined by its generator matrix G G M"^" as [3] 

A(G) = {A e W\3z e Z" : A = Gz}. (1) 

Let g be a prime number and C denote g-ary linear (n, k) code. Then a lattice over the code C is 
defined as 

A(C) = {A G Z"i3c G C : A = c(mod q)}. (2) 

In other words, the lattice A(C) consists of integer-valued sequences which are equal to codewords from 
C by modulo q. In the case of binary codes the least significant bits of the sequences A G A(C) are 
codewords of C. 

The lattice defined by Q is known as Construction A [3]. 

When a n-dimensional lattice is used as a codebook, the source sequence x G is approximated by 
some lattice point A G A(C). The whole set A(C) is assumed to be ordered and a number representing the 
optimal approximation point A is transmitted instead of x. Therefore, we need some rule which assigns a 
number to each point of A(C). To suggest such a rule we consider the following representation of A(C): 

A(C) = U {gZ" + c^}. (3) 

m=l 

In other words, A(C) is a union of q'' cosets which leaders are coderwords of C. This representation 
allows a one-to-one mapping of all A G A(C) onto the set of integer vectors (m, b), where m is a 
codeword number and b = (bi, 6„) G Z„ defines the point number in gZ"^. The lattice point A can be 
reconstructed from (m, b) by the formula 

A = + qb. (4) 

The lattice A(C) or its scaled version aA(C), a > can be used as a codebook for vector quantization. 
Below we always assume a = 1 since instead of quantizing the random vector x with aA(C) we can 
quantize x/a with A(C). We assume also that 

• The stationary memoryless source sequences are drawn from X" = {x\x = {xi, . . . , C M" with 
a known probability density function (pdf) f{x) = HiLi fi^i)- 

. The squared Euclidean distance 



d{x,y) = 




is used as a distortion measure. 



4 



The quantization procedure can be described as a mapping y = Q{x), where x G X", y G A(C). The 
mean square error (MSE) of the quantizer 

D = -M[\\x-Q{x)\\^]=-y2 [ \\x-K\\^f{x)dx (5) 
is minimized by the mapping 

Q{x) = Aj if X eTZi 
where TZi are Voronoi regions [3] of lattice points, i.e., 

TZi = n{Xi) = {xeW -.Wx-XiW^ <\\x- Ajlp Vj 7^ i}. 

All Voronoi regions of lattice points are congruent to each other. We denote the Voronoi region which 
contains the origin as IZ. We use the conventional notations 



V(7^) = I dx 
Jtz 



G-PJ) = -^Sl^ (6) 

for the volume and the normalized second moment (NSM) of 7^, respectively. 

By entropy coding the rate R of the lattice quantizer can be arbitrary close to the entropy of the lattice 
points 

^ = — y^PiiogPi, 

n ^-^ 

i 

where the probabilities of the lattice points are equal to 



Pi=p{Xi)= fix)dx, i = l,2, 



The rate-distortion function R{D) of the lattice quantizer can be easily estimated under the assumption 
that the quantization errors are so small that the pdf f{x) can be approximated by a constant value inside 
each region TZ(Xi). In this case, the rate can be expressed via the differential entropy of the source 



h{X) = - J f (x) log f{x)dx 
and the volume V{TZ) of the Voronoi region 71, 

R(D) < h(X) - -logVin). (7) 

n 

For the average distortion value D from ^ and ^ we have 

D = Gn{n)V(nf/''. (8) 
From the last two equations and the Shannon lower bound, 

H{D) > h{X) - - log(27reL)) 

we obtain the inequalities 

H{D) < R{D) < H{D) + ^ log (27^eG„(7^)). (9) 

Therefore to prove that the optimal quantization would be asymptotically achievable by lattice quanti- 
zation (i.e., R{D) H{D), if n ^ oo) we need to prove that GniTZ) l/(27re) when n ^ oo. 
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TABLE I 

Random coding upper bounds on NSM for optimum code rate and for code rate i?c = 1/2 



q 


Ro 




Rc = -Ro 


-Rc = 1/2 


2 


0.4144 


0.0598 


0.0631 


3 


0.4633 


0.0587 


0.0592 


5 


0.5000 


0.0586 


0.0586 


7 


0.5000 


0.0585 


0.0585 


oo 


0.5000 


0.0585 


0.0585 



Consider a lattice A(C) over a g-ary linear (n, k) code, and denote the code rate -Rc = k/n. It can be 
shown (see [14]) that the overall volume of the q'' Voronoi regions of the codewords of C coincides with 
the volume of the n-dimensional cube with edge length equal to q. It means that V{TZ) = q"'/q'' = 
and from ^ follows 

D = Gn{n)q^^^-^^\ (10) 



Formally we cannot use equation (ITOI) for finding NSM for a lattice over a code by estimating the 
quantization error for a random variable uniformly distributed over the cube. The reason is that the n- 
dimensional cube with length of the edge q does not contain precisely q^ complete Voronoi regions. Some 
regions near the boundaries of the cube protrude from the cube. To solve this problem the "cyclic" metric 

p(x,y) = min{(x - ^/)^ (|x - y\ - qf) (11) 



was introduced in [14]. It was shown that the NSM of the lattice satisfies ([101) if the lattice points of the 
cube are used as codebook entries and the distortion D is computed with respect to the metric p(-, ■). 

Below we will use ([TOl) to estimate the NSM of convolutional codes and in [14] this equation was used 
to obtain the random coding bound on the achievable NSM value. The reformulated result of [14] (see 
comments in Appendix) is presented here without a proof. 

Theorem 1: For any prime number q and any e > 0, there exists a sequence Ci, C2, C3 . . ., of q-ary 
linear codes, where C„ is a code of length n, and such a number N, that for all n > 

|G„(g)-rfog'^^"^''~'^| (12) 

where Gn{q) is the second normalized moment of the lattice based on the code C„, is a solution of 
the equation 

"1/2 i:^g[s,x) 



d = 2 I % —dx 

g{s,x) 



(13) 



and Ro{q) is defined as 

i?o(g) = ^^("2-2 \ngil/i2do),x)dx) (14) 



where 

g{s,x) 



1 

q ^-^ 



k=0 



and p(-, ■) is defined by ([TT]) . 

The values Goo{q) = linin^oo Gn{q) for g = 2, 3, 5, 7 are given in TableHIl Also we show in the table the 
code rate value -Rc = -^0(9) which minimizes the random coding estimate on the NSM value Goo{q) and 
the NSM value achievable when we consider codes with nonoptimal rate -Rc = 1/2. Presented estimates 
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give the achievable NSM values if long enough codes are used for quantization. In particular, the search 
results presented in Section |lll] show that the lattices obtained from convolutional codes with a number 
of encoder states above 512 are roughly 0.2 dB away from the asymptotically achievable NSM value for 
a given alphabet size q. On the other hand, rate Rq = 1/2 binary codes with 64 states have even better 
NSM than the NSM computed by averaging over randomly chosen infinitely long rate -Rc = 1/2 codes. 

An analysis of the asymptotic behavior of Gooio) and -Ro(?) with g — > oo leads to the following result: 

Corollary 1: 

limGoolg) = 1^ (16) 
limi?o(g) = (17) 

The proof is given in Appendix. 



III. NSM-OPTIMUM CONVOLUTIONAL CODES 

It follows from Corollary 1 that codes with rate -Rc = 1/2 provide asymptotically optimum NSM value 
for large code alphabet size q. It follows from the data in Table HI] that codes with rate Rc =1/2 give 
near-optimum NSM except for q = 2. Below we consider only rate Rc = 1/2 codes since they are most 
convenient for practical applications. 

Moreover, we will search for good linear codes for quantization among truncated convolutional codes, 
since for such codes the quantization (search for a codeword closest to the source sequence) can be easily 
performed using the Viterbi algorithm. 

A g-ary, rate Rc = 1/2 convolutional encoder over GF{q) with memory m can be described by the 
polynomial encoding matrix 

G{D) = {g,{D) g^m 

where 

gi(D)=gio + gaD + ---+g,mD"', i = l,2 

are g-ary polynomials such that maxj {deg gi{D)} = m. 

In polynomial notations the infinite length input information sequence 

u{D) =uo + uiD + ■■■ + UiD' + ■■■ 

and the corresponding infinite length codeword 

v{D) = Vo + viD H h ViD' H , where Vi = {v}, v^) 

satisfy the matrix equation 

v{D) =u{D)G{D). (18) 

The transformation u{D) to v(D) is invertible if at least one of two polynomials is delay-free (gio = 1 
or g2o = 1). The convolutional encoder (fTSi) is non-catastrophic if gi{D) and g2{D) are relatively prime 
([19]). Since for any convolutional code both catastrophic and non-catastrophic encoders do exist, we can 
consider only non-catastrophic encoders without loss of optimality. 

For any polynomial a{D) = ao + ciiD + ■ ■ ■ + ajD* + • ■ ■ let [a(D)]" = ao + aiD + ■ — h a„-D" denote 
the finite degree polynomial obtained from a{D) by truncating it to degree n. 

Furthermore we consider g-ary linear (n, k = n/2) codes which codewords are coefficients of truncated 
vector polynomials [i;(D)]"/^ obtained from degree k = n/2 information polynomial u{d) by encoding 
according to (fTSi) . 

The trellis representation of (n, n/2) binary linear code obtained from (1 + D + D'^, l+D"^) convolutional 
encoder is shown in Fig.[T] For g-ary codes, g > 2, similar trellises can be constructed but with g branches 
leaving each node and merging in each node of the trellis diagram. 
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Fig. 1. Trellis representation of (n, n/2) binary linear code obtained from the (1 + , 1 + D + D^) convolutional encoder 

TABLE II 

Optimum binary convolutional codes and their NSMs. The generators are given in octal notation. 



states num. 


generator 


Gn 


En. gain, dB 


2 


[3;1] 


0.0733 


0.5571 


4 


[7;5] 


0.0665 


0.9800 


8 


[1V;13] 


0.0652 


1.0657 


16 


[31;23] 


0.0643 


1.1261 


32 


[61;57] 


0.0634 


1.1873 


64 


[165;127] 


0.0628 


1.2286 


128 


[357;25I] 


0.0623 


1.2633 


256 


[625;467] 


0.0620 


1.2843 


512 


[I207;1I7I] 


0.0618 


1.2983 


oo 




0.0598 


1.4389 



To find the NSM for a {gi{D) , g2{D)) convolutional encoder we first estimate the quantization error 
for source sequences uniformly distributed in the hypercube [0,g]'\ when the "cyclic metric" •) (fTTI) 
is used as the distortion measure (see [14]). Then this error is recalculated into the NSM value using (fTOl) . 

To speed up the exhaustive search for the optimum encoders over the set of all memory m encoders 
with generators {gi{D) , g2{D)) we used the following rules to reduce the search space without loss of 
optimality, 

• Only delay-free polynomials are considered g^ = g^ = 1. 

• Since the codes with the generators {gi{D), g2{D)) and {g2{D), gi{D)) are equivalent, we con- 
sider only encoders with deg{gi{D)) > deg{g2{D)). If deg{gi{D)) = deg{g2{D)) then we require 
\giiD)\ > \g2{D)\, where we denote 

deg(o(r>)) 

\am= E 

In other words, we interpret the polynomial coefficients as positions of a number written in g-ary form. 
Thereby we we assign an integer number (value) to each polynomial and only pairs of generators 
sorted in descending order of the corresponding values are used as candidates. 

• Only non-catastrophic encoders are considered, i.e., gcd{gi{D) , g2{D)) = 1. 

Our search results are presented in Tables HIl [nil and |W] for q =2,3, and 5, respectively. It is easy to 
see that the NSM values of the optimum codes decrease when q grows, but the improvement provided by 
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TABLE III 

Optimum ternary convolutional codes and their NSMs. 



states num. 


generator 


Gn 


En. gain, dB 


3 


[12;11] 


0.0720 


0.6349 


9 


[121;111] 


0.0663 


0.9931 


27 


[1211;1112] 


0.0641 


1.1396 


81 


[11222;10121] 


0.0626 


1.2424 


243 


[110221;101211] 


0.0617 


1.3053 


729 


[1000112;112122] 


0.0614 


1.3265 


oo 




0.0586 


1.5231 



TABLE IV 

Optimum quinary convolutional codes and their NSMs. 



states num. 


generator 


Gn 


En. gain, dB 


5 


[14;13] 


0.0716 


0.6591 


25 


[131;102] 


0.0642 


1.1328 


125 


[1323;1031] 


0.0622 


1.2703 


625 


[10314;10133] 


0.0613 


1.3336 


oo 




0.0585 


1.5229 



ternary or quinary codes over binary codes is small and does not worth the complication of the encoding. 
Notice also, that the determined NSM values correspond to less shaping gain than 1.36 dB reported in 
[8] for the 256-states quaternary codes. To achieve the same shaping gain we have to have NSM=0.0609 
whereas the best found 625-state 5-ary code provides NSM=0.0613 which is approximately 0.02 dB worse 
than the value for the Marcellin-Fischer code. 

Notice, however, that the Marcellin-Fischer quantizer [8] is not a lattice quantizer. Therefore, the NSM 
values reported here are the best among known multidimensional lattices. 

IV. Arithmetic coded lattice quantization 

Let us consider the lattice over a linear (n, k) code C (not necessarily a truncated convolutional code). 
The entropy quantization of the input sequence x can be performed in two steps: 

• Approximation, i.e., finding the vector of indices b and the codeword number m E {0, — 1} 
that minimizes the distortion value d{x, A) = ?7,~^||a; — A|p = — (c^ + Qb)\\'^ (see dH)). 

• Entropy lossless coding of the pair (m,6). 
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First, we consider the approximation step. From @ the following reformulation of the encoding problem 
follows. 



mm 

A 



mm mm 

m b 



mm mm 

m b 



X 



y^jxt - Cmt - qbt 



t=i 



mm 



mm 



t=i 

n 

t=i 



min 



<lhf] 



min{;u(a;, c^)} . 



(19) 



Here we introduced the additive metric 



t=i 

min I (x — c — qb)'^^ 



[X 



qbo 



(20) 



where the optimum index value bo can be determined by "scalar quantization" with step q applied to 

(x - c). 



'Jo 



X 



q 



(21) 



By (■) we denote rounding to the nearest integer. 

It follows from (fT9l) , (|20l) . and (|2TI) that lattice quantization can be split into two steps. First, for all 
t = 1, ...,n and c = 0,...,g — 1, the values Xt — c should be scalar quantized and the nx q indices 6^ and 
the corresponding metrics fitc = {xt — c — qbtcY are to be computed. In the second step, the codeword Cm 
yielding the minimum of the metrics ii{x, c„J among all codewords G C, m G {0, q^ 



be found. The codeword number m and vector of indices b = {ptc^^, ■■■,btc„ 
closest to X. The formal description of the algorithm is presented in Fig. |2l 



1} should 
describe the trellis point 



The problem of searching for the optimum entry over the infinite cardinality multidimensional lattice 
codebook is now divided into two steps: qn scalar quantization operations followed by searching for the 
best codeword among the q'' codewords of the g-ary linear (n, k) code. The latter step is similar to soft 
decision maximum-likelihood decoding of error correcting linear block codes and there are many such 
decoding algorithms known; in particular for truncated convolutional codes the Viterbi algorithm is a 
reasonable choice. 

Now let us consider an algorithm for a variable-length lossless coding of the pair (m, b) obtained on 
the approximation step. 

For a rate Rq = 1/2 truncated convolutional {n, k = n/2)-code, the codeword is determined by length 
k information sequence u = (ui, ...,Uk) and the codeword can be represented as a sequence of k tuples 
of length 2, i.e., c = (ci, Ck), Ct = {cj, c^). We write the corresponding vector of indices in tuples of 
length 2 too, i.e., b = {bi, bk), bt = {b], bf). 

Denote by at = {ut-i, ...,Ut-m) the encoder state at time t, where we assume Ut = for t <0. 

To describe arithmetic coding (for details see [20]) it is enough to define a probability distribution on 
the pairs (n, b). Thus we introduce the following two probability distributions, 

{i}{ut\(Tt)=^{u\(T), O-GGF^} 
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Input: Source sequence x = {xi,X2, ■ ■ ■ , 

Output: Number m of codeword E C, index sequence b = (bi,b2, ■ ■ ■ ,bn). 

Metrics and indices computation: 
for t = 1 to n do 

for c = to g — 1 do 




Utic) = (xt -c-qht{c)f 
end 
end 

Codeword search: 

Find the codeword = (c^i, 0^2, • • • , Cmn) that minimizes metric ^{x, c^) = YJt=i fJ'ticmt)- 
Result : 

The algorithm outputs the codeword number m and the index vector 

b = (6l(Cmi), b2{Cm2), • • • , bniCmn))- 



Fig. 2. Approximation step of lattice quantization. 



{ip{b]\ci) = Vib\c), beZ,ce G¥„ i = 1, 2} 

which we estimate for a given source pdf by training using a long simulated data sequence. The required 
probability distribution p(n, b) is defined as 

k 

p{u,b) = ]l^iu,\aMbl\clMb',\c',) 

t=i 

where code symbols cl,i = 1,2, are uniquely defined by Ut and at. 

Notice that similar entropy coding for trellis quantization was described in [21], [8], and [13]. The 
difference is that in these papers the approximation alphabet is finite, whereas in our algorithm the 
approximation alphabet is not necessary finite since we allow any lattice point to be an approximation 
vector. 

Simulation results of the described arithmetic-coded quantization will be presented in next section. 
Below we present a modification which is efficient for low quantization rates for non-uniform probability 
distributions. 

As it was mentioned in the Introduction, the codes which are good in terms of granular gain (NSM value) 
are not necessarily optimal for non-uniform distributions. The reason is that spheres are not necessarily 
good Voronoi regions in this case. Moreover, Voronoi regions are not optimal quantization cells in this 
case. To understand this better, we consider properties of optimal entropy-constrained scalar quantization. 

First of all, notice that for a wide class of probability distributions the simulation results for optimal 
non-uniform scalar quantization are presented in [17]. It is easy to verify that, for example, for GGD 
random variables with parameter a < 0.5, for all bit rates the best trellis quantizers loose with respect to 
optimum scalar quantizers [13]. The analysis of optimum and near-optimum scalar quantizers from [17], 
[18] show that: 

• For GGD random variables the optimum ECSQ quantizer has the larger quantization cells near zero 
and smaller cells for large values. 

• The quantization thresholds are not equal to half of the sum of the neighboring approximation values. 
Therefore, the minimum MSE (or minimum Euclidean distance) quantizers are not optimum ECSQ 
quantizers. 
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Input: Source sequence a; = (xi, X2, . . . , 

Output: Number m of codeword Cm G C, index sequence b = (61, 62, • 


• 


Metrics and indices computation: 
for t = 1 to n do 

e = o; 

if X( < — A then 

^ = Xt + A 
end 

if X( > A then 

^ = xt-A 
end 

for c = to g — 1 do 




h / 

- \ — / 

= {^-c-qbt{c)f 

end 
end 




T«T "V" ^ r~i • 

Find the codeword Cm = (cmi, 0^2, • • • , Cmn) that minimizes metric fj,{x, 


Cm) = Y2t=l f^t{Cmt) ■ 


Result : 

The algorithm outputs the codeword number m and the index vector b = 


= {bi{Ci),b2{c2),...,bn{Cn)). 


Fig. 3. Approximation withi an extended zero zone 



• The quantizers which have all quantization cells of equal size except the one symmetrically located 
around zero (so called "extended zero zone" (EZZ) quantizer), has performances very close to that 
of the optimum ECSQ quantizer . 
It follows from these observations that good trellis quantizers should be obtained as a generalization of 
the EZZ quantizer to the multidimensional case. (Another method of extending the zero zone for trellis 
quantization, so called New Trellis Source Code (NTSC), have been presented in [13]). 

To describe the multidimensional EZZ quantizer we need to generalize (l20l) and (fT9l) to the case when 
the input data from some interval [—A, A], A > are artificially interpreted as zeroes. Therefor we replace 
each input r.v. x by another r.v. defined as 

{X + A X < -A 
0, -A < X < A (22) 
X - A X > A. 

Then we apply (|20|) and (fT9l) to the r.v. ^. The generalized version of the algorithm describing the 
approximation step of lattice quantization is shown in Fig. [3l The parameter A is different for different 
pdf's and quantization rates. For each particular pdf and rate we minimize average distortion value by 
selecting the best value of A from the set 2""*, s = 0, 1, 2, 3. Simulation results will be presented in next 
section. 



Now we consider the reconstruction of the approximation vector y = {yi, Un) by the decoder from 
the received pair (m, b = (61, 6„)). First, we introduce an additional integer parameter L which defines 
the amount of approximation values which are estimated using a long training sequence (in our simulations 
we used L = 10). For the training source sequence x = {xi, ...,xn) we performed the lattice quantization 
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described above and found the optimum codeword c = {ci,...,cn) and the corresponding sequence of 
indices b = {bi, b^). Then for all b E {—L, —1, 1, ...L} and c G 0, . . . , g — 1 we computed "refined" 
approximation values 

r 0, 



Pcb = < 



6 = 0, c = 
+ A, 6g{1,...L}, cG {0,...,g-l} or 
6 = 0, cG{l,...,g-l} 
^^-^-■^^ A, 6g{-L,...,-1}, cG{0,...,g-l} 



\Jcb 



cb\ 



yt 



where Jcb = {t\ct = c,bt = b}. The set of values Pcb is assumed to be known to decoder. 

When the pair (m, b = (fei, is received, the decoder first finds the codeword 
and then reconstructs the approximation vector y = {yi, ...,yn) as 

Cmt + qbt 6* ^ {-L, -1, 1, ...L}. 

V. Simulation results 

This section contains the results of the arithmetic coded lattice quantization for different types of 
sources. We consider the parametric class of generalized Gaussian distributions. The probability density 
function is parametrized by the parameter a and expressed as 



where 



2r(l/a) 

ri{a, a) = a"^ 
V[z)- 



exp{-[r/(a,a)|x|]"} 
r(3/a) T 

r(i/«). 



a > 



and cr denotes the standard deviation. The Laplacian and Gaussian distributions are members of the family 
of generalized Gaussian distributions with parameters a = 1.0 and a = 2.0, respectively. 

Simulation results for the Gaussian distribution are presented in Table |Vl The SNR achieved by the 
"New Trellis Source Code" (NTSC) [13] and by the "Entropy Constrained Scalar Quantization" (ECSQ) 
[17] are also shown in the table. We can see that the estimated SNR value for the encoder with 512 
states is only about 0.2 dB below Shannon limit. For encoders with 4 and 32 states we obtain significant 
improvements over the NTSC-based quantization. 

Simulation results for the Laplacian distribution and for the GGD with a = 0.5 are presented in Tables 
rvTl and IVII[ respectively. We show the quantization efficiency both without EZZ (A = 0) and with EZZ 
(A > 0). Also data from [13] are given in parenthesis. It follows from the simulation results that the EZZ 
is efficient for low quantization rates (below 2 bit/sample) and provides an SNR gain of about 0.1 dB 
for the Laplacian source and about 0.4 dB for the GGD with a = 0.5. The gap between the achieved 
SNR and the theoretical limit H{D) for these two probability distributions is nearly the same: 0.2 - 0.3 
dB. Notice also, that for the GGD with a = 0.5, low-complexity (less than 32 encoder states) low rate 
quantizers do not perform better than the ECSQ. Therefore, constructing efficient quantization algorithms 
for low rate coding for the GGD with small a remains to be an open problem. 



TABLE V 

Gaussian source. Data for NTSC [13] in parenthesis. 



Rate 


Trellis states 


H{D) 


ECSQ 


2 


4 


8 


16 


32 


64 


128 


256 


512 


0.5 


2.29 


2.50 


2.55 


2.61 


2.64 


2.69 


2.69 


2.76 


2.76 


3.01 


2.10 


1 


5.06 


5.53 
(5.33) 


5.61 


5.66 


5.71 
(5.64) 


5.74 


5.74 


5.85 


5.85 


6.02 


4.64 


2 


11.08 


11.50 
(11.15) 


11.63 


11.69 


11.72 
(11.37) 


11.77 


11.77 


11.82 


11.84 


12.04 


10.55 


3 


17.11 


17.55 
(16.71) 


17.64 


17.70 


17.74 
(16.96) 


17.80 


17.80 


17.85 


17.87 


18.06 


16.56 



TABLE VI 

Laplacian source. Data for NTSC [13] in parenthesis. 



Rate 


A 


Trellis states 


H{D) 


ECSQ 


2 


4 


8 


16 


32 


64 


128 


256 


512 


0.5 



0.25 


2.92 
3.06 


3.03 
3.15 


3.06 
3.17 


3.09 
3.18 


3.11 
3.20 


3.14 
3.22 


3.17 
3.23 


3.18 


3.21 


3.54 


3.11 


1 



0.25 


5.69 
5.87 


6.05 
6.14 
(5.93) 


6.14 
6.20 


6.18 
6.27 


6.23 
6.33 
(6.07) 


6.29 
6.40 


6.30 
6.42 


6.33 


6.33 


6.62 


5.76 


2 




0.125 


11.68 
11.74 


12.15 
12.15 
(11.49) 


12.22 
12.24 


12.29 


12.32 
(11.72) 


12.38 


12.41 


12.43 


12.44 


12.66 


11.31 


3 





17.74 


18.15 
(17.00) 


18.23 


18.29 


18.33 
(17.15) 


18.39 


18.41 


18.44 


18.44 


18.68 


17.20 



TABLE Vn 

Generalized Gaussian source with a = 0.5. Data for NTSC [13] in parenthesis. 



Rate 


A 


Trellis states 


H{D) 


ECSQ 






2 


4 


8 


16 


32 






0.5 





4.74 


4.81 


4.83 


4.83 


4.84 


5.62 


5.37 




0.5 


5.19 


5.22 


5.23 


5.23 


5.24 






1 





8.00 


8.13 


8.18 


8.22 


8.22 


9.21 


8.61 




0.5 


8.53 


8.53 


8.62 


8.64 


8.64 












(8.11) 






(8.29) 






2 





14.31 


14.71 


14.80 


14.90 


14.94 


15.60 


14.58 




0.25 


14.56 


14.90 


15.03 


15.10 


15.14 












(14.13) 






(14.47) 






3 





20.54 


20.96 


21.04 


21.10 


21.16 


21.70 


20.49 




0.25 


20.62 


21.06 


21.14 


21.20 


21.26 












(19.61) 






(19.92) 
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Appendix 

Comments to formulation of Theorem [B 

In [14] expressions (fT3l) and (fT4l) were given in the following form 



1 



dx 



Q Jo 9[s,x) 



(23) 



^o(g) = 7^(-^-- ring{l/{2do),x)dx). (24) 
IngV 2 gJo ^ 

The simplification is due to the observation that g{s,x) is a periodical function of x with period 1 and 
inside the interval [0, 1] the function g{s,x) is symmetric with respect to the middle point x = 1/2, i.e., 
gis,x) = g{s, 1 - x). 
Proof of Corollary [B 

Here we analyze the asymptotic behavior of the estimate of Gn{q) given by Theorem [T] when q ^ oo. 
From definitions (fTTI) and (flSl) for a; G [0, 0.5] we obtain 

(^X /c) , /c = 0, ^ 2 

(fc-a:-g)^ fc=2±i,...,g-l 

— -1 

1 ^ _^ 1 |- ^sik-.-,r (26) 



= <! )l J ^^2 .I.Vr' ^ . (25) 



g[s,x) 



q — ' q 

^ fc=0 ^ fc=2+i 



9-1 
2 



- ^ e^(^-^)',x e [0,0.5]. (27) 



q 



To prove the Corollary, we choose 



s = -- (28) 



q 

The. first step is to verify that for large q this choice makes (fT3l) valid, i.e.. 

The second step is to substitute this expression into (fT4l) and show that for large g we have Ro{q) 1/2. 
After these two steps we immediately obtain the main result, 

lim Goo(g) = lim doq^^^^'^^-^^ = — . 

g^oo q^oo 2716 

We start the first step of derivations by demonstrating that asymptotically for large q the generating 
function g{s,x) does not depend on x. By straightforward computations it is easy to verify that 

9-1 

max{g{s,x)} = g{s,0) = - V e^^^' (30) 
X q ^-^^^ 

k= — 

9-1 

min{^7(s,x)} = g{sM2) = - V e^(2/c-i)V4_ (31) 

a; q 
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In order to estimate these sums we will use [22], formula 552.6, that is. 



For large q we obtain from (l30l) that 

1 2 °° 

1 2/1 
2 



1 
2 



(32) 



TT 
S 



Substituting (|28l ) and upper-bounding the sum by the geometric progression for g > 2, we obtain the 
upper bound 

^ (33) 



gis^ x) < maxfyffs, x) < — — + e 



To obtain a lower bound on the sum in (|3TI) we rewrite it in the form 



2 



sg2/4 



Notice that the integer (2A; — 1) in the exponent of summands runs over odd values. If we add terms 
corresponding to intermediate even values the sum increases less than twice. Therefore, 



^1 



■t 1 ^ OO _ CO 

1/2) > - ^ e^^'/^ = -Y, e^^'" 

^ ] 1 ^ ; 1 ^ 1 I 



k=l fc=l ^ k=g+l 

Next we estimate second sum by the geometric progression and estimate first sum using ( |32] ). After 
straightforward computations we obtain 



g{s,x) > mmg{s,x) > 



1 



1 



qe q 



(34) 



We can substitute (|33l) and (|34l) into (fT3l) to obtain lower and upper bounds on (io(Q')- In both cases we 
have to estimate the following integral 



.1/2 g 

2 I —g{s,x)dx 



9-1 
"1/2 1 2 



f - y {x-kfe<^-'^"dx 



9-1 

2 ;.-/c+l/2 



fc-l/2 



1 2 



q J-g/2 

^ erf r - ^e-"f^ 
27re3/2''^' 1^ 2 ) 47re ' 
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From inequalities ([22], formula 592) 



e 



1 — < erf (x) < 1 



follow the bounds 

^ - ^e-^ < 2 /■ -qis x)dx < 
Substituting them together with (|33] ) and ( |34l) into ( fT3] ) we obtain 



2716 



(i-o(,v.))<4<^(iH-o(-L)) 



Therefore (|29l) is proven and the first step completed. 

To fulfil the second step and the proof of Corollary, we have to substitute the bounds (|33] ) and (|34l ) 
into expression (fT4l) . After simple derivations we immediately obtain (flTl) ■ 
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